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(57) Abstract 

The APC tumor suppressor protein binds to /?^catenin, a protein recently shown to interact with Tcf/Lef transcription .factors Here, 
the ge^en^n^ that is expressed in colonic epithelium (hTcH) was cloned and ch * ac ^ 

SSSn^whcn associated with /^atenin. Nuclei of APC^ colon carcinoma cells were found to contom a stable ^f™;™?~* 
SSw» constitutive* active, as measured by transcription of a Tcf reporter gene. Reintroduce of ^ C K remov ^/ A ^ " ^ 
^^XS^A the transcriptional transactivaUon. Constitutive transcription of TCF target genes, caused by loss of APC function 
may be a crucTeVent in the eariy transformation of colonic epithelium. It is also shown here that the products of mutant APC genes 
round Tm coS^ are defective in regulating ^ateninATcf-4 transcriptional activation. Furthermore, colorectal tumors with intact 
^^^^Z to contain subtle activating mutations of ^atenin that altered functionally significant phosphorylation sites. Ttae 
S Wi^tffitation of /^atenin is critical to APC's tumor suppressive effect and that this regulate can be circumvented by 
mutations in either APC or ^-catenin. 
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(i-CATENIN, TCF-4, AND APC INTERACT 
TO PREVENT CANCER 



Hie U.S. Government has a paid-up license in this invention 
5 and the right in limited circumstances to require the patent owner to license 

others on reasonable terms as provided for by the terms of grant CA57345 
awarded by the National Institutes of Health. 
TECHNICAL, FIELD OF TTTE TNVFNTTfW 

This invention is related to the field of cancer diagnostics and 

10 therapeutics. More particularly it relates to methods for diagnosing and 

treating cancers associated with APC or p-catenin mutations. 
BACKGROUND OF THE INVENTION 

Mutations of the adenomatous polyposis coli (APC) gene are the most 
common disease-causing genetic events in humans; approximately 50% of 

15 the population will develop colorectal polyps initiated by such mutations 

during a normal life span (14). Individuals who inherit APC mutations 
develop thousands of colorectal tumors, consistent with APC's tumor 
suppressor or "gatekeeping" role in colorectal tumorigenesis (15,16). APC 
homodimerizes through its amino-terminus (77), and interacts with at least 

20 six other proteins: |3-catenin (18), y-catenin (plakoglobin) (19), tubulin (20), 

EB1 (27), hDLG, a homologue of a Drosophila tumor suppressor protein 
(22), and ZW3/GSK3P kinase (25). Whether any of these interacting 
proteins communicate APC growth-controlling signals is unknown. Thus 
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there is a need in the art for a fuller understanding of how the tumor 
suppressor gene APC functions in cells. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide human nucleotide 
sequences encoding transcriptional activation proteins. 

It is another object of the present invention to provide isolated 
preparations of transcriptional activation proteins. 

It is an object of the present invention to provide methods of 
determining the presence or absence in a cell of wild-type APC or a 
downstream protein in the APC transcription regulatory pathway. 

Another object of the invention is to provide methods of identifying 
candidate drugs for use in Familial Adenomatous Polyposis (FAP) patients 
or patients with increased risk of developing cancer. 

It is yet another object of the invention to provide methods of 
identifying candidate drugs for the treatment of cancer patients, in particular 
those with APC or P-catenin mutations. 

Another object of the invention is to provide a method for diagnosing 
cancer in a sample suspected of being neoplastic. 

Another object of the invention is to provide a method for treating 
a patient with colorectal cancer or other cancer associated with FAP. 

These and other objects of the invention are achieved by providing 
one or more of the embodiments described below. In one embodiment of 
the invention an intron-free DNA molecule is provided which encodes Tcf-4 
protein as shown in SEQ ID NO: 2 or 4. 

According to another embodiment of the invention an isolated Tcf-4 
protein is provided. The protein is substantially free of other human 
proteins, and has a sequence as shown in SEQ ID NO: 2 or 4. 

In another embodiment of the invention a method is provided for 
determining the presence or absence in a cell of wild-type APC or a 
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downstream protein in the APC transcription regulatory pathway. The 
method comprises the steps of: 

introducing a Tcf-responsive reporter gene into the cell; and 
measuring transcription of said reporter gene; wherein a cell 
5 which supports active transcription of said reporter gene does not have wild- 

type APC or does not have a wild-type downstream protein in the APC 
transcription regulatory pathway. 

According to yet another embodiment of the invention a method is 
provided for determining the presence or absence in a cell of wild-type 
10 APC. The method comprises the steps of : 

contacting a Tcf-responsive reporter gene with a lysate of the 

cell; and 

measuring transcription of said reporter gene; wherein a lysate 
which inhibits said transcription has wild-type APC. 
15 In still another embodiment of the invention a method of identifying 

candidate drugs is provided. The drugs may be useful for treatment of FAP 
or other cancer patients or patients with increased risk of developing cancer. 
The method comprises the steps of: 

contacting a cell having no wild-type APC or a mutant P- 
20 catenin with a test compound; 

measuring transcription of a Tcf-responsive reporter gene, 
wherein a test compound which inhibits the transcription of the reporter gene 
is a candidate drug for cancer therapy. 

According to yet another aspect of the invention another method is 
25 provided for identifying candidate drugs for use in for use in FAP patients, 

colon cancer patients, patients with mutations in P-catenin or APC, or 
patients with increased risk of developing cancer. The method, comprises 
the steps of: 
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contacting a Tcf-responsive reporter gene with a test 
compound under conditions in which the reporter gene is transcribed in the 
absence of the test compound; and 

measuring transcription of the Tcf-responsive reporter gene; 
5 wherein a test compound which inhibits said transcription is a candidate drug 

for cancer therapy . 

According to another aspect of the invention a method is provided for 
identifying candidate drugs for use in FAP patients or patients with increased 
risk of developing cancer. The method comprises the steps of: 
10 contacting a test compound with P-catenin and Tcf-4 under 

conditions in which P-catenin and Tcf-4 bind to each other, and 

determining whether the test compound inhibits the binding 
of P-catenin and Tcf-4, a test compound which inhibits the binding being a 
candidate for cancer therapy or prophylaxis. 
15 According to still another embodiment of the invention a method is 

provided for diagnosing cancer in a sample su?:pected of being neoplastic, 
the method comprises the steps of: 

comparing a CTNNB sequence found in the sample to a 
second CTNNB sequence found in a normal tissue, wherein a difference 
20 between the first and second sequence is an indicator of cancer. 

According to another aspect of the invention a method is provided for 
treating a patient with colorectal cancer or other cancer associated with FAP. 
The method comprises the step of: 

administering to the patient a nucleotide sequence comprising 
25 a portion of the APC coding sequence, said portion consisting of the P- 

catenin binding site. 

According to another aspect of the invention a method is provided for 
treating a patient with colorectal cancer or other cancer associated with FAP. 
The method comprises the step of: 



administering to the patient a polypeptide comprising a 
portion of the APC coding sequence, said portion consisting of the P-catenin 
binding site. 

TTie present invention thus provides the art with diagnostic, therapeutic 
and drug discovery methods especially useful for FAP and other cancers 
with ^PCor P-catenin mutations. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1. Sequence comparison of hTcf-4 and hTcf-1. 
Two alternative splice, forms of hTcf-4 were identified, each encoding 
a different COOH-terminus. One form (hTcf-4E) was homologous to 
hTCF-IE (top) (7); the other form (hTcf-4B) was homologous to hTcf-lB 
(bottom). The highly conserved NH^terminal interaction domain and the 
High Mobility Group (HMG) box DNA-bindirig region are boxed. 
Abbreviations for the amino acids are: A. Ala; C, Cys; D, Asp; E, Glu; 
F, Phe; G, Gly; H, His; I, He; IC, Lys; L, Leu; M, Met; N, Asn; P, Pro; 
Q, Gin; P, Ar g; S, Ser; T, Thr; V, Val; W, Tip; and Y, Tyr. The 
nucleotide sequence has been deposited in GenBank (accession number: ) 

Fig. 2. Analysis of hTcf-4 expression in colonic epithelium. 

(Fig. 2A) Northern blot analysis of hTcf-4, hTcf-1, hLef-I 
expression in Jurkat T cells (lane i) ; colonic mucosa (lane 2); colon 
carcinoma cell lines DLD-1 (lane 3), HCT116 (lane 4); SW480 (lane 5); 
SW620 (lane 6); HT29 flane 7). Lane 2 contains 5 ^g total RNA; all 
others contain 15 fig total RNA. The positions of 18S and 28S ribosomal 
RNAs are shown. EtBr, ethidium bromide stain. (Fig. 2B) In situ 
hybridization of healthy human colon tissue to an hTcf-4 probe. (Fig. 2C) 
In situ hybridization to a negative control probe (a fragment of the E. coli 
neomycin resistance gene). 
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Fig. 3. Transactivational properties of P-catenin/hTcf-4. 
All reporter assays were performed as duplicate transfections. For 
each condition, both values are shown. (Fig. 3A) Reporter gene assays in 
5 IIA1.6 B cells. Cells were transfected by electroporation with 1 jig 

luciferase reporter plasmid, 5 gtg P-catenin expression plasmid, and 3 II- 
hTcf-4 expression plasmids. Empty pCDNA was added to a total of 10 pg, 
plasmid DNA. (Fig. 3B) Reporter gene assays in SW480 colon carcinoma 
cells. Cells were transfected with 0.3 ftg, of the indicated luciferase reporter 

10 gene, 0.7 ftg pCATCONTROL as internal control, the indicated amounts of 

pCMVNeoAPC, and empty PCDNA to a total of 2.5 fig plasmid DNA. 
Control CAT values are given in the right panel. 

Fig. 4. Constitutive presence of P-catenin-hTcf-4 complexes in APC 7 ' 
cells. Gel retardation assays were performed on nuclear extracts from the 

15 indicated cell lines before and after a 20-hour exposure to Zn + + . Samples 

in lanes 1, 4, 7, 10 were incubated under standard conditions. To the 
samples in lanes 2, 5, 8, 11, 0.25 /*g, anti P-catenin was added. To the 
samples in lanes 3, 6, 9, 12, 0.25 fig of a control (human CD4) antibody 
was added. N.S., nonspecific band also observed with mutant (nonbinding) 

20 probe (lane Mt). 

Fig. 5. Effects of APC mutations on CRT. (Fig. 5A) Schematics of 
wild-type (WT) and mutant APC, APC is a 2843-amino-acid (AA) protein 
(32) with contains armadillo (ARM) repeats in the amino-terminus (33), 15 
and 20 AA P-catenin-binding repeats in the central region (18,19), and a 

25 basic region in the carboxyl-terminus (32). The carboxyl-terminus also 

contains a TXV sequence which mediates DLG binding (22). (Fig. SB) 
Effects of WT and mutant APC on CRT. SW480 cells containing 
endogenous mutant APC were transfected with the APC expression vectors 
shown in (Fig. 5A) and CRT was measured. Cells were transfected with 

30 increasing amounts of WT APC (0, 0.15 and 0.5 fig) or 0.5 /ig mutant 
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APC. CRT reporter activities are expressed relative to assays containing no 
WT APC and are the means of three replicates. Error bars represent 
standard deviations. 

lipofectamine was used to cotransfect SW480 cells with an internal 
5 control (0.5 fig pCMV-pgal), a reporter construct (0.5 fig pTOPFLASH or 

pFOPFLASH) and the indicated amount of the various APC expression 
vectors. The pTOPFLASH reporter contained an optimized Tcf-binding site 
5' of a luciferase reporter gene, whereas pFOPFLASH contained a mutated 
site that does not bind Tcf . The amount of DNA in each transfection was 

10 kept constant by addition of an appropriate amount of empty expression 

vector (pCEP4). Luciferase and P-galactdsidase activities were determined 
16 hours after transfection. Luciferase activity was corrected for transfection 
efficiency (using the control P-galactosidase activity) and nonspecific 
transcription (using the pFOPFLASH control). 

15 Fig. 6. Evaluation of CRT in colorectal cancer cell lines with WT 

APC. (Jig. 6A) Immunoblot of endogenous APC in the DLD1, SW480, 
HCT116, SW48 and 293 cell lines, developed with APC monoclonal 
antibody FE0 (34). (Fig. 6B) Effects of exogenous WT APC on CRT in 
cell lines with endogenous mutated or WT APC. Cells were transfected 

20 with increasing amounts (0, 0.15 fig, 0.5 fig for DLD1 and SW48; 0, 0.5 

fig, 5 fig for HCT116) of WT APC or APC1309A mutant (0.5 fig for DLD1 
and SW48; 5 fig for HCT116) and CRT was assessed as in Fig. 5. CRT 
reporter activities are expressed relative to activity in extracts without 
exogenous APC and are the means of three replicates. Error bars represent 

25 standard deviations. 

Fig. 7. Evaluation of P-catenin in colorectal cancer cell lines with WT 
APC. (Fig. 7A) Immunoblot of the cell lines used in this study, developed 
with P-catenin monoclonal C 19220 (Transduction Laboratories, Lexington, 
KY)(J7). (Fig. 7B) Sequence of CTNNB1 in HCT116 and SW48 . 

30 Overlapping segments constituting the entire CTNNB1 were amplified by 
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RT-PCR from SW480, DLD1, HCT116, and SW48 cells, and sequenced 
directly with ThermoSequenase (Amersham). In the case of HCT116, a 
PCR product containing the deleted region was also cloned into pCI-neo 
(Promega, Madison) and multiple clones corresponding to each allele were 
individually sequenced. 

The left panel (nts 121 to 143 from HCT116) reveals the presence of 
a deletion in addition to the WT sequence. The middle panel (antisense 
strand 156 to 113 of the WT and deleted alleles of HCT116) reveals the 3-bp 
deletion (ATCT) that removed codon 45 in half the clones. The right panel 
(nts 80 to 113 from SW48) reveals a C to A transition affecting codon 33 
(TCT to TAT). (Fig. 7C) Schematic of P-catenin illustrating the armadillo 
repeats (33) and negative regulatory domain. The residues in larger type fit 
the consensus sequence for GSK3p phosphorylation (29) and those in bold 
have been demonstrated to affect down regulation of P-catenin through 
GSK3P phosphorylation in Xenopus embryos (27). The five mutations 
found in human colon cancers are indicated at the top. 

Fig. 8. Functional evaluation of P-catenin mutants. (Fig. 8A) 
Constitutive nuclear complex of P-catenin and Tcf in HCT1 16 cells. The 
presence of nuclear p-catenin-Tcf complexes was assessed by gel shift 
assays. Lanes 1 to 3 ? optimal Tcf retardation probe shifted with nuclear 
extract from HCT116 cells with addition of no antibody (lane 1), anti 
P-catenin (0.25 /ig, lane 2), or an irrelevant antibody (0.25 fig, lane 3). 
Lane 4, mutant Tcf retardation probe shifted with nuclear extract from 
HCT116 cells. n.s M nonspecific shifting seen with the mutant probe. (Fig. 
8B) Effects of the P-catenin mutations on CRT. 293 cells were transfected 
with WT (WT) or mutant (A45, S33Y) p-catenin and CRT was assessed. 
CRT reporter activities are expressed relative to WT P-catenin and are the 
means of three replicates. Error bars represent standard deviations. 
P*catenin expression constructs were prepared as follows. WT CTNNB1 was 
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amplified by RT-PCR from SW480 cells and cloned into the mammalian 
expression vector pCI-neo (Promega) to produce pCI-neo-P-cat. The 
pCI-neo-P-cat A45 and S33Y were generated by replacing codons 1 to 89 in. 
pCI-neo-Prcat with a PCR product encoding the equivalent region from 

5 HCT116 or SW48 cDNA, respectively. The structures of all constructs 

were verified by sequence analysis. Lipofectamine was used to cotransfect 
293 cells with an internal control (0.1 /xg CMV-pgal), a reporter (0.5 /*g 
pTOPFLASH or pFOPFLASH), a Tcf-4 expression vector (0.5 fig 
pCDNA-TCF4), and P-catenin (0.5 fig) or dominant negative hTcf-4 1.0 /ig) 

10 expression vectors. CRT was determined as described above. 



DETATLFD DFSC 

It is a discovery of the present invention that hTcf-4 binds to P- 
catenin and activates transcription in colorectal qpithelial cells. Moreover, 

15 it has now been found that APC regulates this transcriptional activation, at 

least in part by binding to P-catenin. In colorectal cancer cells this 
regulation is frequendy abrogated, either by mutation of APC or by 
mutation of p-catenin. 

Two alternative splice forms of human Tcf-4 have been found. 

20 One form (hTcf-4E) is homologous to hTcf-lE and the other (hTcf-4B) is 

homologous to hTcf-lB. The sequence of the nucleotide and amino acid 
sequences are shown in SEQ ID NOs: 1-4. The coding sequences and 
proteins can be used in assays as described below. Intron-free DNA 
molecules are provided which are originally made by reverse transcription 

25 of a mRNA molecule. They can be propagated in cells or amplified as is 

desired. Isolated Tcf-4 proteins can be provided substantially free of other 
human proteins if, for example, the nucleotide sequences are expressed in 
non-human cells. Methods and vectors for achieving such expression are 
well known in the art. Choice of such expression means is made by the 

30 skilled artisan according to the desired usage and convenience. 
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Cells can be tested to determine if they have a wild-type APC or a 
wild-type downstream protein in the APC transcription regulatory pathway, 
called herein the CRT pathway ( P-catenin/Tcf-regulated transcription). One 
protein within the CRT pathway which has been identified as a target of 
5 mutations in human cancers is P-catenin (encoded by the CTNNB1 gene). 

Other parts of the pathway are also likely to be targets. Although the target 
genes of the CRT pathway have not been identified, they can be readily 
identified using the system disclosed here. Genes which are differentially 
transcribed in the presence of wild-type and mutant CTNNB1, for example, 

10 can be identified. 

Tcf-responsive reporter genes are those constructs which comprise 
a readily detectable or assayable gene (such as luciferase, P- galactosidase, 
chloramphenicol acetyltransferase) linked in cis to a Tcf-responsive element. 
Such responsive elements are known in the art (7) and any such elements 

15 can be used. An optimal Tcf motif contains the sequence CCTTTGATC. 

From one to twenty copies, and preferably from three to six copies, of the 
motif may be used. Mutation of the sequence to CCTTTGGCC abrogates 
responsiveness. Another necessary part of such constructs is a minimal 
promoter, such as the c-Fos or the Herpes virus thymidine kinase promoter. 

20 Transcription of the reporter gene may be performed by any means known 

in the art, usually by assaying for the activity of the encoded gene, although 
immunological detection methods can also be used. In addition, 
transcription can be monitored by measuring the transcribed mRNA directly, 
typically using oligonucleotide probes. 

25 As shown below, a cell which has a wild-type APC protein will 

inhibit CRT. However, most mutations in APC render APC unable to 
inhibit CRT. Similarly, certain mutations in CTNNB1 render P-catenin 
super-active and/or refractory to the inhibition by APC. Thus measuring 
Tcf-responsive reporter gene transcription is an indication of the status of 
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A PC and CTNNB1. Mutations in both of these genes are associated with 
cancers and therefore provides diagnostic and prognostic information* 

Assays for CRT can be accomplished in vitro or in cells. If the 
assay is to be accomplished in cells, then a Tcf-responsive reporter gene 
must be introduced into the cell. Any means for introducing genetic 
material into cells can be used, including but not limited to infection, 
transfection, electroporation. If the assay is to be performed in vitro then 
the components for transcription must be present. These include suitable 
buffers, RNA polymerase, as well as ribonucleotides. If the protein product 
is to be assayed, then the components for translation must also be present, 
such as ribosomes, and amino acids. 

These assays can also be used to screen compounds for potential 
as anti-cancer therapeutic agents. Using either the in vitro or cell form of 
the assay, test compounds can be introduced to determine whether they are 
able to mimic the effect of wild-type APC or to convert a mutant APC into 
a form which is able to inhibit CRT or a mutant P-catenin into a form which 
is regulatable by APC. In addition, compounds can be tested for the ability 
to inhibit the binding of P-catenin and Tcf-4, thus mimicking the action of 
APC. Such a test can be conducted in vitro or in vivo, for example using a 
two hybrid assay. 

A means for diagnosis of cancers is the result of the observation that 
CTNNB1 mutations are found in tumor cells, especially those which have 
wild-type APC. Such mutations can be found, inter alia, by sequencing 
either the gene or the protein found in a sample. Functional assays can also 
be used, such as whether P-catenin binds to APC or Tcf-4, or whether it is 
capable of mediating CRT. Sequences can be compared to those found in 
a normal tissue of a human, especially the same human who provided the 
sample being tested. Suitable tumors for testing include, but are not limited 
to those which are associated with FAP. Suitable tumors include colorectal 
cancer, thyroid cancer, brain cancer, medulloblastoma, desmoid tumor, 
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osteoma, breast cancer, and head and neck cancer. Because APC mutations 
are so frequent, and because it appears that APC mutations do not occur in 
the same tumors as CTNNB1 mutations, one can prescreen samples for APC 
mutations before performing a CTNNB1 determination. 

5 Hie portion of the APC gene which encodes the P-catenin binding site 

can be used in a gene therapy format Suitable techniques are known in the 
art for administering genes to tumors, and any such technique can be used. 
Suitable expression vectors are also known in the art and it is within the skill 
of the artisan to select an appropriate one. Upon expression in a tumor cell 

10 of the P-catenin binding portion of APC, P-catenin will be bound and 

titrated away from binding to Tcf-4, thus preventing unregulated expression 
of the CRT target genes. Similarly, a polypeptide portion of APC 
containing the P-catenin binding site can be administered to cells to perform 
a titration of p-catenin. Techniques for such administration to cells is well 

15 known in the art. Cells which are treated with either the polynucleotide or 

the polypeptide can be used to study the interaction between APC and P- 
catenin, and for developing drugs which interfere with such binding. 

The above disclosure generally describes the present invention. A 
20 more complete understanding can be obtained by reference to the following 

specific examples which are provided herein for purposes of illustration 
only, and are not intended to limit the scope of the invention. 

EXAMPLE 1 

This example identifies Tcf-4 as the expressed family member in 
25 colorectal epithelial cells and provides the complete sequence of the cloned 

cDNA. 

There are four known members of the Tcf/Lef family in mammals: the 
lymphoid-specific factors Tcf- I and Lef- 1 (7,8), and the less well 
characterized Tcf-3 and 4(9). We performed a qualitative Reverse 
30 Transcriptase-Polymerase Chain Reaction assay for expression of the four 
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Tcf/Lef genes on 43 colon tumor cell lines. While most colon cell lines 
expressed more than one of the genes, only hTcf-4 mRNA was expressed in 
essentially all lines. 

We then screened a human fetal CDNA library and retrieved clones 

5 encoding full-length hTcf-4 (Fig. 1). A genomic fragment encoding, the 

HMG box region of hTcf-4 (7) was used to probe a human 12 week-fetal 
cDNA library in Lambda GT-11. Positive clones were subcloned into 
pBluescriptSK and sequenced. See SEQ ID NOs: 1 and 3. The predicted 
sequence of hTcf-4 was most similar to that of hTcf-1. Alternative splicing 

10 yielded two COOH-termini that were conserved between hTcf- 1 and hTcf-4. 

The NH^ terminus, which in hTcf- 1, mLef-1 and Xenopus TCF-3 mediates 
binding to P-catenin (6), was also conserved in hTcf-4. Northern blot 
analysis of selected colon carcinoma cell lines revealed high-level expression 
of hTcf-4 (Fig. 2A). Northern blot hybridizations (7) were performed with 

15 full-length hTcf-l 9 hLef-I and hTcf-4 cDNA. Colon epithelial cells were 

freshly prepared from a mucosal preparation dissected from a healthy 
surgical colon sample. The sample was minced, and incubated with 1 mM 
dithiothreitol (DTT) in Hanks' medium to remove mucus. Single-cell 
suspensions were prepared by incubation at RT in 0.75 mM EDTA in 

20 Hanks' medium. Epithelial cells were separated from lymphocytes by 

Percoll gradient centrifugation. 

As evidenced by in situ hybridization (Fig. 2, B and C) and Northern 
blotting (Fig. 2A), hTcf-4 MRNA was readily detectable in normal colonic 
epithelium, whereas hTcf I and hLeM were not detectable. In situ 

25 hybridization of 6 ft frozen sections of healthy colon biopsy samples were 

performed as described(lO). hTcf-4 cDNA encoding amino acids 200 to 310 
was amplified and labeled with Dig-ll-dUTP (Boehringer Mannheim, 
Germany) by PCR. After hybridization and washing, the sections were 
sequentially incubated with mouse anti-Dig antibody (Boehringer) and a 

30 horseradish peroxidase conjugated rabbit antibody to mouse immunoglobulin 
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(Dako, Glostrup, Denmark) . The signal was visualized with 
diaminobenzidine, which produces a reddish-brown precipitate. Blue 
counterstining was performed with haematoxyline. 



5 EXAMPLE 2 

This example demonstrates the interaction of Tcf-4 and p-catenin and 
their function as a transcriptional activating factor. 

To investigate whether hTcf-4 functionally interacts with P-catenin, we 
used two sets of reporter constructs in a p -catenin-Tcf reporter gene assay 

10 (7). One contained three copies of the optimal Tcf motif CCTTTGATC, or 

three copies of the mutant motif CCTTTGGCC, upstream of a minimal c- 
Fos promoter driven-luciferase expression (PTOPFLASH and 
PFOPFLASH). The second set contained three copies of the optimal motif, 
or three copies of the mutant motif, upstream of a minimal Herpes virus 

15 thymidine kinase promoter driven-Chloramphenicol Acetyl Transferase 

(CAT) expression (PTOPCAT and PFOPCAT, respectively). Reporter gene 
assays were performed as in (7). In brief, 2 x 10 6 cells were transfected 
with plasmids by electrqporation. After 24 hours, cells were harvested and 
lysedin 1 mMDTT, 1 % Triton X-100, 15 % glycerol, 25 mMTrispH 7.8 

20 and 8 mM MgCl 2 , cDNAs encoding Myc-tagged versions of P-catenin and 

hTcf-4 were inserted into the mammalian expression vector pCDNA 
(Invitrogen). PCATCONTROL, encoding the CAT enzyme under the 
control of the SV40 promoter, was purchased from Promega. 

Epitope-tagged hTcf-4 and a deletion mutant lacking, the NH 2 -terminal 

25 30 amino acids (ANhTcf-4) were cloned into the expression vector pCDNA. 

Transient transfections were performed in a murine B cell line (IIA1.6), that 
does not express any of the Tcf genes (6). 

The TOPFLASH reporter was strongly transcribed upon cotransfection 
with the combination of p-catenin and hTcf-4 plasmids, but not with the 

30 individual plasmids or with the combination of P-catenin and ANhTcf-4 
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plasmids. No enhanced transcription was detected in cells transfected with 
the negative control PFOPFLASH (Fig, 3A). These results show that 
interaction of the NH 2 -terminus of hTcf-4 with P-catenin results in 
transcriptional activation. 
5 EXAMPLE 3 

This example demonstrates the functional regulation of CRT 
transcriptional activation by wild-type APC. 

In three APC^ carcinoma cell lines, SW480, SW620 and DLD-1 (Fig. 
3B), the PTOPFLASH reporter was 5-20 fold more actively transcribed than 

10 PFOPFLASH. Importantly, transfection of SW480 cells with the reporter 

gene and an APC-expression vector abrogated the transcriptional activity in 
a dose-dependent manner (Fig. 3B). In contrast APC had no effect on a 
cotransfected internal control (pCATCONTROL), or on the basal 
transcription of PFOPFLASH (Fig. 3B). The use of PTOPCAT and 

15 PFOPCAT instead of PTOPFLASH and PFOPFLASH led to comparable 

observations. Hie constitutive transcriptional activity of Tcf reporter genes 
in APC 7 " colon carcinoma cells was in stark contrast to the inactivity of 
these genes in non-colonic cell lines, including HA 1.6 B cells (Fig. 3A), the 
C57MG breast carcinoma cell line; the Jurkat and BW5147 T cell lines; the 

20 Daudi and NS1 B cell lines; the K562 erythromyeloid cell line; the HeLa 

cervical carcinoma line; the HepG2 hepatoma cell line; 3T3, 3T6, and Rat-I 
fibroblasts; and the kidney derived SV40-transformed COS cell line (7,16). 

EXAMPLE 4 

This example demonstrates that a functional P-catenin-hTcf-4 complex 
25 exists constitutively in APC 7 " cells. 

We used HT29-APC A colon carcinoma cells (12), in which APC is 
controlled by a metallothionein promoter. Induction by Zn ++ restores wild- 
type levels of APC, and leads to apoptosis (12). HT29-Gal cells which 
carry a Zn ++ -inducible LacZ gene were used as a control. The only Tcf 
30 family member expressed in HT29 is hTcf-4 (Fig. 2C). In nuclear extracts 
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firom uninduccd HT29 derived transfectants, we readily detected hTcf-4 by 
gel retardation (Fig. 4). An additional band of slightly slower mobility was 
also observed. The addition of a P-catenin antibody resulted in the specific 
retardation of the latter band, indicating that it represented a P-catenin-hTcf- 
4 complex (Fig. 4) (12). After Zn ++ induction for 20 hours, the P- 
catenin-hTcf-4 complex was diminished sixfold relative to uncomplexed 
hTcf-4 in HT29-APC1, while no significant change was observed in HT29- 
Gal cells (Fig. 4). Importantly, the overall levels of cellular P-catenin do 
not change during the induction period in HT29-APC1 cells (12). 

Gel retardation assays were performed as described elsewhere (7). 
Extracts were prepared from intact nuclei that were washed four times to 
avoid contamination with cytoplasmic P-catenin. As the optimal Tcf/Lef 
probe, we used a double-stranded 15-mer CCCTTTGATCTTACC; the 
control probe was CCCTTTGGCCTTACC. (All oligonucleotides were 
from Isogene, Holland). The P-catenin antibody was purchased from 
Transduction Laboratories (Lexington, KY). A typical binding reaction 
contained 3 /ig nuclear protein, 0. 1 ng radiolabeled probe, 100 ng of dldC, 
in 25 Ad of binding buffer (60 mm KC1, 1 mM EDTA, 1 mM DTT, 10% 
glycerol). Samples were incubated for 20 min at room temperature, 
antibody was added, and the samples incubated 20 min further. 

On the basis of these data, we propose the following model. In 
normal colonic epithelium hTcf-4 is the only expressed member of the Tcf 
family. The interaction of P-catenin with hTcf-4 is regulated by APC. 
When appropriate extracellular signals are delivered to an epithelial cell, p- 
catenin accumulates in a form that is not complexed with GSK3P-APC, and 
that enables its nuclear transport and association with hTcf-4. The HMG 
domain of hTcf-4 binds in a sequence-specific fashion to the regulatory 
sequences of specific target genes; P-catenin supplies a transactivation 
domain. Thus, transcriptional activation of target genes occurs only when 
hTcf-4 is associated with P-catenin. The hTcf-4 target genes remain to be 
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identified. However, the link with APC and catenin suggests that these 
genes may participate in the generation and turnover of epithelial cells. 
Upon loss of wild-type APC, monomelic p -catenin accumulates in the 
absence of extracellular stimuli, leading to uncontrolled transcription of the 
5 hTcf-4 target genes. The apparent denovo expression of other members of 

the Tcf family in some colon carcinoma cell lines might lead to a further 
deregulation of Tcf target gene expression by the same mechanism. The 
control of P-catenin -Tcf signaling is likely to be an important part of the 
gatekeeper function of APC (19), and its disruption an early step in 
10 malignant transformation. 

EXAMPTP, S 

This example demonstrates that mutant APC protein does not 
regulate CRT and that a complete set of 20-AA repeats in APC is required 

15 to mediate inhibition of CRT. 

We tested four APC mutants (Fig. 5A) for their ability to inhibit 
P-catenin/Tcf-iegulated transcription (CRT) in transfection assays. The 
first mutant, APC331A represents a type of mutation found in the germline 
of Familial Adenomatous Polyposis (FAP) patients as well as in sporadic 

20 tumors (15). The APC331A protein is truncated at codon 331, 

amino-terminal to the three 15-amino-acid (AA) P-catenin-binding repeats 
between codons 1020 and 1169. The second mutant, APC1309A, is the 
most common germline APC mutation (15), a 5-bp deletion that produces a 
frameshift at codon 1309 and truncation of the protein. The APC1309A 

25 protein retains the 15-AA P-catenin repeats but lacks the seven 20-AA 

repeats between codons 1323 and 2075 that have been implicated in binding 
and phosphorylation of P-catenin (28). The third mutant, APC1941A, 
represents one of the most distal somatic mutations observed in colorectal 
tumors (25). The APC1941A protein is truncated at codon 1941 and 

30 therefore contains the 15-AA repeats and all but the last two 20-AA repeats. 
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Finally, APC2644A represents a germline mutation resulting from a 4-bp 
deletion in codon 2644. Patients with this type of unusual carboxyl-terminal 
mutation develop few polyps (attenuated polyposis) but have pronounced 
extracolonic disease, particularly desmoid tumors (2(5). 

5 Each of the APC mutants was cotransfected with a CRT reporter into 

the SW480 colorectal cancer cell line. SW480 cells have truncated APC and 
constitutivdy active CRT which can be suppressed by exogenous WT APC. 
Although all four mutants produced comparable levels of APC protein after 
transfection, they varied in their CRT inhibitory activity. The three mutants 

10 found in patients with typical polyposis or cancer were markedly deficient 

in inhibition of CRT (Fig. 5B). The reduced activity of APC1309A and 
APC1941A suggests that p-catenin binding is not sufficient for 
APC-mediated inhibition of CRT and that the complete set of 20- AA repeats 
is required. Interestingly, the inhibitory activity of the APC2644A mutant 

15 associated with attenuated polyposis was comparable to that of WT APC 

(Fig. 5B), suggesting that the DLG-binding domain at the carboxyl-terminus 
of APC is not required for down-regulation of CRT. 



WT and mutant APC constructs (2 /xg) were transfected into 293, 
20 SW480, and HCT116 cells using lipofectamine (GIBCO/BRL, 

Gaithersburg). Protein was harvested 24 hours later and subjected to 
immunoblot analysis with APC monoclonal antibody FE9 (23). In HCT1 16 
and 293 cells, exogenous WT APC comigrated with the endogenous APC. 
In SW480 cells, APC1309A comigrated with the endogenous mutant APC. 
25 In all other cases, the nonfunctional APC constructs (APC331A, APC 

1309A, and APC 1941 A) produced as much or more protein than the 
CRT-functional forms of APC (APC WT and APC 2644 A). 



EXAMPLE 6 
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This example demonstrates that other components of the APC- 
regulatory pathway are affected in some cancer cells. 

We evaluated CRT in two colorectal tumor cell lines (HCT116 and 
SW48) that express full-length APC (Fig. 6A). Both HCT116 and SW48 
displayed constitutively active CRT and, in contrast to cell lines with 
truncated APC (DLD1 and SW480), this activity was not inhibited by 
exogenous WT APC (Fig. 5B, 6B). Other (noncolorectal cancer) cell lines 
expressing WT APC do not display constitutive CRT activity. These 
transfection results suggested that the constitutive CRT in HCT116 and 
SW48 might be due to an altered downstream component of the APC tumor 
suppressor pathway. 



EXAMPTF.7 

This example demonstrates a defect in the gene encoding P-catenin 
in some cancer cells, which affects CRT. 

We evaluated the status of a likely candidate for a downstream 
component of the APC tumor suppressor pathway, p-catenin, in the same 
four lines. All four lines expressed similar amounts of apparently intact 
P-catenin, as assessed by immunoblots (Fig. 7A). However, sequence 
analysis revealed that both HCT116 and SW48 harbored mutations in the 
P-catenin gene (CTNNB1) (Fig. 7B). HCT116 had a 3-bp deletion that 
removed one AA (Ser-45), and SW48 had a C to A missense mutation that 
changed Ser-33 to Tyr. Analysis of paraffin-embedded archival tissue from 
the HCT116 patient confirmed the somatic nature of this mutation and its 
presence in die primary tumor prior to culture. Interestingly, both mutations 
affected serines mat have been implicated in the downregulation of P-catenin 
through phosphorylation by the ZW3/GSK3P kinase in Xenopus embryos 
(Fig. 7C) (27,28). 
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Genomic DNA was isolated from paraffin-embedded normal and 
tumor tissue from the patient from whom the HCT1 16 cell line was derived. 
A 95 bp PCR product encompassing the mutation was then amplified by 
PCR and directly sequenced using THERMOS EQUENASE (Amersham); 
5 The 3 bp deletion was observed in tumor but not in normal tissue- 

To test the generality of this mutational mechanism, we evaluated 
five primary colorectal cancers in which sequencing of the entire coding 
region of APC revealed no mutations (25). Three of these five tumors were 
found to contain CTNNB1 mutations (S45F, S45F, and T44A) that altered 
10 potential ZW3/GSK3P phosphorylation sites (Fig. 7C). Each mutation 

appeared to affect only one of the two CTNNB1 alleles and to be somatic. 

Genomic DNA was isolated from frozen-sectioned colorectal 
cancers and a 1001 bp PCR product containing exon 3 of CTNNB1 was 
then amplified by PCR and direcdy sequenced using TherrnoSequenase 
15 (Amersham). An ACC to QCC change at codon 41 (T41A) and a T£T to 

TJLT at codon 45 (S45F) was observed in one and two tumors, respectively. 

EXAMPLE 8 

This example demonstrates dominant mutations of CTNNB1 that 

20 render CRT insensitive to the effects of WT APC. 

Because the P-catenin mutations were heterozygous, we hypothesized 
that the mutations might exert a dominant effect, rendering a fraction of 
cellular P-catenin insensitive to APC-mediated down regulation. To test this 
notion, we performed gel shift analyses with nuclear extracts from 

25 untransfected HCT1 16 cells. In contrast to noncolorectal cancer cell lines 

with intact APC, HCT116 cells contained a P-catenin/Tcf complex that 
gel-shifted an optimized Tcf-binding oligonucleotide, and this complex 
supershifted with anti-p-catenin (Fig. 8A). We also constructed p-catenin 
expression vectors and compared the biologic activity of the mutant 

30 P-catenin from HCT116 (p-Cat A45)and SW48 (p-Cat S33Y) with that of 
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their WT counterpart. For these experiments, we used the 293 kidney 
epithelial cell line as it is highly transfectable, exhibits low endogenous 
CRT, and contains a high level of endogenous APC (Fig. 6A). In the 
presence of endogenous APC, both mutant p-catenins were at least 6-fold 
5 more active than the WT protein and this activity was inhibited by 

dominant-negative hTcf-4 (Fig. UB). 

Together, these results indicate that disruption of APC-mediated 
regulation of CRT is critical for colorectal tumorigenesis. This is most 
commonly achieved by recessive inactivating mutations of bothy4PC alleles 

10 but, as shown here, can also be achieved by dominant mutations of CTNNB1 

that render CRT insensitive to the effects of WT APC. Our results suggest 
that APC inhibition of CRT requires phosphorylation of P-catenin at 
multiple sites. These potential phosphorylation sites are consistent with the 
known specificity of ZW3/GSK3P (29) a serine kinase that negatively 

15 regulates P-catenin in Xenopus and Drosophila cells (27) and that interacts 

with APC and P-catenin in mammalian cells (23). These results also suggest 
a functional basis for the occasional CTNNBl mutations observed in other 
tumor types (30) and illustrate how a critical pathway in human disease can 
be illuminated by the discovery of mutations in different components of the 

20 pathway. The next step in understanding APC function will be the 

identification of the genes that are activated by hTcf-4/p-catenin complexes 
and inhibited by WT APC. These genes are likely to be related to APC's 
ability to induce apoptosis in colorectal cancer cells (31). 
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SEQUENCE LISTING 
SEQ ID NO: 1 Tcf-4B nucleotide 
Tcf4B 



ATGCCGCAGC TGAACGGCGG TGGAGGGGAT GACCTAGGCG 

CCAACGACGA ACTGATTTCC 
TTCAAAGACG AGGGCGAACA GGAGGAGAAG AGCTCCGAAA 

ACTCCTCGGC AGAGAGGGAT 
TTAGCTGATG TCAAATGGTC TCTAGTCAAT GAATCAGAAA 

CGAATCAAAA CAGCTCCTCC 
GATTCCGAGG CGGAAAGACG GCCTCCGCCT CGCTCCGAAA 

GTTTCCGAGA CAAATCCCGG 
GAAAGTTTGG AAGAAGCGGC CAAGAGGCAA GATGGAGGGC 

TCTTTAAGGG GCCACCGTAT 
CCCGGCTACC CCTTCATCAT GATCCCCGAC CTGACGAGCC 

CCTACCTCCC CAAGCGATCC 
GTCTCGCCCA CCGCCCGAAC CTATCTCCAG ATGAAATGGC 

CACTGCTTGA TGTCCAGGCA 
GGGAGCCTCC AGAGTAGACA AGCCCTCAAG GATGCCCGGT 

CCCCATCACC GGCACACATT 
GTCTCTAACA AAGTGCCAGT GGTGCAGCAC CCTCACCATG 

TCCACCCCCT CACGCCTCTT 
ATCACGTACA GCAATGAACA CTTCACGCCG GGAAACCCAC 

CTCCACACTT ACCAGCCGAC 
GTAGACCCCA AAACAGGAAT CCCACGGCCT CCGCACCCTC 

CAGATATATC CCCGTATTAC 
CCACTATCGC CTGGCACCGT AGGACAAATC CCCCATCCGC 

TAGGATGGTT AGTACCACAG 
CAAGGTCAAC CAGTGTACCC AATCACGACA GGAGGATTCA 

GACACCCCTA CCCCACAGCT 



-26- 

CTGACCGTCA ATGCTTCCGT GTCCAGGTTC CCTCCCCATA 

TGGTCCCACC ACATCATACG 
CTACACACGA CGGGCATTCC GCATCCGGCC ATAGTCACAC 

CAACAGTCAA ACAGGAATCG 
TCCCAGAGTG ATGTCGGCTC ACTCCATAGT TCAAAGCATC 

AGGACTCCAA AAAGGAAGAA 
GAAAAGAAGA AGCCCCACAT AAAGAAACCT CTTAATGCAT 

TCATGTTGTA TATGAAGGAA 
ATGAGAGCAA AGGTCGTAGC TGAGTGC ACG TTGAAAGAAA 

GCGCGGCCAT CAACCAGATC 
CTTGGGCGGA GGTGGCATGC ACTGTCCAGA GAAGAGCAAG 

CGAAATACTA CGAGCTGGCC 
CGGAAGGAGC GACAGCTTCA TATGCAACTG TACCCCGGCT 

GGTCCGCGCG GGATAACTAT 
GGAAAGAAGA AGAAGAGGAA AAGGGACAAG CAGCCGGGAG 

AGACCAATGG AGAAAAAAAA 
AGTGCGTTCG CTACATACAA GGTGAAGGCA GGTGCCTCAG 

CCCACCCTCT TCAGATGGAA 
GCTTACTAGA TTCGCCTCCC CCCTCCCCGA ACCTGCTAGG 

CTCCCCTCCC CGAGACGCCA 
AGTCACAGAC TGAGCAGACC CAGCCTCTGT CGCTGTCCCT 

GAAGCCCGAC CCCCTGGCCC 
ACCTGTCCAT GATGCCTCCG CCACCCGCCC TCCTGCTCGC 

TGAGGCCACC CACAAGGCCT 
CCGCCCTCTG TCCCAACGGG GCCCTGGACC TGCCCCCAGC 

CGCTTTGCAG CCTGCCGCCC 
CCTCCTCATC AATTGCACAG CCGTCGACTT CTTGGTTACA 

TTCCCACAGC TCCCTGGCCG 
GGACCCAGCC CCAGCCGCTG TCGCTCGTCA CCAAGTCTTT 

AGAATAGCTT TAGCGTCGTG 
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AACCCCGCTG CTTTGTTTAT GGTTTTGTTT CACTTTTCTT 

AATTTGCCCC CCACCCCCAC 
CTTGAAAGGT TTTGTTTTGT ACTCTCTTAA TTTTGTGCCA 

TGTGGCTACA TTAGTTGATG 
5 TTTATCGAGT TC ATTGGTCA ATATTTGACC CATTCTTATT 

TCAATTTCTC CTTTTAAATA 
TGTAGATGAG AGAAGAACCT CATGATTGGT ACCAAAATTT 

TTATCAACAG CTGTTTAAAG 
TCTTTGTAGC GTTTAAAAAA TATATATATA TACATAACTG 
10 TTATGTAGTT CGGATAGCTT 

AGTTTTAAAA GACTGATTAA AAAACAAAAA AAAAAAAAGC 

TTGCGAGGGA TCCCCCGGGA 

SEQ ID NO: 2 Tcf-4B amino acid 
15 Tcf-4B 

MPQLNGGGGDDLGANDELIS 
FKDEGEQEEKS SENSSAERD 
LADVKS SLVNESETNQNSSS 

20 DSEAERRPPPRSESFRDKSR 
ESLEEAAKRQDGGLFKGPPY 
PGYPFIMIPDLTSPYLPKRS 
VSPTARTYLQMKWPLLDVQA 
GSLQSRQALKDARSPSPAHI 

25 VSNKVPVVQHPHHVHPLTPL 
ITYSNEHFTPGNPPPHLPAD 
VDPKTGIPRPPHPPDISPYY 
PLSPGTVGQIPHPLGWLVPQ 
QGQPVYPITTGGFRHPYPTA 

30 LTVNASVSRFPPHMVPPHHT 
LHTTGIPHPAIVTPTVKQES 
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SQSDVGSLHSSKHQDSKKEE 

EKKKPHIKKPLNAFMLYMKE 

MRAKVVAECTLKES AAINQI 

LGRRWHALSRE E Q A K Y Y E L A 

5 RKERQLHMQLYPGWSARDNY 

Or K K K K R K R D K Q P G E T N G E K K 

SAFATYKVKAAASAHPLQME 
A Y 

10 SEQ ID NO: 3 Tcf-4E nucleotide 

Tcf-4E 

GGTTTTTTTT TTTTACCCCC CTTTTTTATT TATTATTTTT 
TTGCACATTG AGCGGATCCT 
15 TGGGAACGAG AGAAAAAAGA AACCCAAACT CACGCGTGCA 

GAAGATCTCC CCCCCCTTCC 
CCTCCCCTCC TCCCTCTTTT CCCCTCCCCA GGAGAAAAAG 

ACCCCCAAGC AGAAAAAAGT 
TCACCTTGGA CTCGTCTTTT TCTTGCAATA TTTTTTGGGG 
20 GGGCAAAACT TTGAGGGGGT 

GATTTTTTTT GGCTTTTCTT CCTCCTTCAT TTTTCTTCCA 

AAATTGCTGC TGGTGGGTGA 
AAAAAAAATG CCGCAGCTGA ACGGCGGTGG AGGGGATGAC 
CTAGGCGCCA ACGACGAACT 
25 GATTTCCTTC AAAGACGAGG GCGAACAGGA GGAGAAGAGC 

TCCGAAAACT CCTCGGCAGA 
GAGGGATTTA GCTGATGTCA AATCGTCTCT AGTCAATGAA 

TCAGAAACGA ATCAAAACAG 
CTCCTCCGAT TCCGAGGCGG AAAGACGGCC TCCGCCTCGC 
30 TCCGAAAGTT TCCGAGACAA 
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ATCCCGGGAA AGTTTGGAAG AAGCGGCCAA GAGGCAAGAT 

GGAGGGCTCT TTAAGGGGCC 
ACCGTATCCC GGCTACCCCT TCATCATGAT CCCCGACCTG 

ACGAGCCCCT ACCTCCCCAA 
5 GCGATCCGTC TCGCCCACCG CCCGAACCTA TCTCCAGATG 

AAATGGCCAC TGCTTGATGT 
CCAGGCAGGG AGCCTCCAGA GTAGACAAGC CCTCAAGGAT 

GCCCGGTCCC CATCACCGGC 
ACACATTGTC TCTAACAAAG TGCCAGTGGT GCAGCACCCT 
10 CACCATGTCC ACCCCCTCAC 

GCCTCTTATC ACGTACAGCA ATGAACACTT CACGCCGGGA 

AACCCACCTC CACACTTACC 
AGCCGACGTA GACCCCAAAA CAGGAATCCC ACGGCCTCCG 

CACCCTCCAG ATATATCCCC 
15 GTATTACCCA CTATCGCCTG GCACCGTAGG ACAAATCCCC 

CATCCGCTAG GATGGTTAGT 
ACCACAGCAA GGTCAACCAG TGTACCCAAT CACGACAGGA 

GGATTCAGAC ACCCCTACCC 
CACAGCTCTG ACCGTCAATG CTTCCGTGTC CAGGTTCCCT 
20 CCCCATATGGTCCCACCACA 

TCATACGCTA CACACGACGG GCATTCCGCA TCCGGCCATA 

GTCACACCAA CAGTCAAACA 
GGAATCGTCC CAGAGTGATG TCGGCTCACT CCATAGTTCA 

AAGCATCAGG ACTCCAAAAA 
25 GGAAGAAGAA AAGAAGAAGC CCCACATAAA GAAACCTCTT 

AATGCATTCA TGTTGTATAT 
GAAGGAAATG AGAGCAAAGG TCGTAGCTGA GTGCACGTTG 

AAAGAAAGCG CGGCCATCAA 
CCAGATCCTT GGGCGGAGGT GGCATGCACT GTCCAGAGAA 
30 GAGCAAGCGA AATACTACGA 
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GCTGGCCCGG AAGGAGCGAC AGCTTCATAT GCAACTGTAC 

CCCGGCTGGT CCGCGCGGGA 
TAACTATGGA AAGAAGAAGA AGAGGAAAAG GGACAAGCAG 

CCGGGAGAGA CCAATGAACA 
5 CAGCGAATGT TTCCTAAATC CTTGCCTTTC ACTTCCTCCG 

ATTACAGACC TCAGCGCTCC 
TAAGAAATGC CGAGCGCGCT TTGGCCTTGA TCAACAGAAT 

AACTGGTGCG GCCCTTGCAG 
GAGAAAAAAA AAGTGCGTTC GCTACATACA AGGTGAAGGC 
10 AGCTGCCTCAGCCCACCCTC 

TTCAGATGGA AGCTTACTAG ATTCGCCTCC CCCCTCCCCG 

AACCTGCTAG GCTCCCCTCC 
CCGAGACGCC AAGTCACAGA CTGAGCAGAC CCAGCCTCTG 

TCGCTGTCCC TGAAGCCCGA 
15 CCCCCTGGCC CACCTGTCCA TGATGCCTCC GCCACCCGCC 

CTCCTGCTCG CTGAGGCCAC 
CCACAAGGCC TCCGCCCTCT GTCCCAACGG GGCCCTGGAC 

GTGCCCCCAG CCGCTTTGCA 
GCCTGCCGCC CCCTCCTCAT CAATTGCACA GCCGTCGACT 
20 TCTTGGTTAC ATTCCCACAG 

CTCCCTGGCC GGGACCCAGC CCCAGCCGCT GTCGCTCGTC 

ACCAAGTCTT TAGAATAGCT 
TTAGCGTCGT GAACCCCGCT GCTTTGTTTA TGGTTTTGTT 

TCACTTTTCT TAATTTGCCC 
25 CCCACCCCCA CCTTGAAAGG TTTTGTTTTG TACTCTCTTA 

ATTTTGTGCC ATGTGGCTAC 
ATTAGTTGAT GTTTATCGAG TTCATTGGTC AATATTTGAC 

CCATTCTTAT TTCAATTTCT 
CCTTTTAAAT ATGTAGATGA GAGAAGAACC TCATGATTGG 
30 TACCAAAATT TTTATCAACA 
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GCTGTTTAAA GTCTTTGTAG CGTTTAAAAA ATATATATAT 

ATACATAACT GTTATGTAGT 
TCGGATAGCT TAGTTTTAAA AGACTGATTA AAAAACAAAA 

AAAA 

5 

SEQ ID NO: 4 Tcf-4E amino acid 
Tcf4-E 

MPQLNGGGGDDLGANDELIS 

10 FKDEGEQEEKS SENS SAERD 

LADVKSSLVNESETNQNSSS 
DSEAERRPPPRSESFRDKSR 
ESLEEAAKRQDGGLFKGPPY 
PGYPFIMIPDLTSPYLPNGS 

15 VSPTARTYLQMKWPLLDVQA 
GSLQSRQALKDARSPSPAHI 
VSNKVPVVQHPHHVHPLTPL 
ITYSNEHFTPGNP P P H L PAD 
VDPKTGIPRPPHPPDISPYY 

20 PLSPGTVGQIPHPLGWLVPQ 
QGQPVYPITTGGFRHPYPTA 
LTVNASVSRFPPHMVPPHHT 
LHTTGIPHPAIVTPTVKQES 
SQSDVGSLHSSKHQDSKKEE 

25 EKKKPH1KKPLNAFMLYMKE 
MRAKVVAECTLKESAAINQI 
LGRRWHALSREEQAKYYELA 
RKERQLHMQLYPGWSARDNY 
GKKKKRKRDKQPGETNEHSE 

30 CFLNPCLSLPPITDLSAPKK 

CRARFGLDQQNNWCGPCRRK 
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KKCVRYIQGEGSCLSPPSSD 

GSLLDSPPPSPNLLGSPPRD 

AKSQTEQTQPLSLSLKPDPL 

AHLSMMPPPPALLLAEATHK 

ASALCPNGALDLPPAALQPA 

APSSSIAQPSTSWLHSHSSL 

AGTQPQPLSLVTKSLEU 

SEQ ID NO: 5 Tcf-responsive optimal motif probe 
CCCTTTGATCTTACC 
SEQ ID NO: 6 Tcf mutant motif 
CCCTTTGGCCITACC 
SEQ ID NO: 7 APC amino acid 

10 20 30 40 50 60 
MAAASYDQLL KQVEALKMEN SNLRQELEDN SNHLTKLETE 
ASNMKEVLKQ LQGSIEDEAM 
70 80 90 100 110 120 
ASSGQIDLLE RLKELNLDSS NFPGVKLRSK MSLRSYGSRE 
GSVSSRSGEC SPVPMGSFPR 
130 140 150 160 170 180 
RGFVNGSRES TGYLEELEKE RSLLLADLDK EEKEKDWYYA 
QLQNLTKRID SLPLTENFSL 
190 200 210 220 230 240 
QTDMTRRQLE YEARQIRVAM EEQLGTCQDM EKRAQRRIAR 
IQQffiKDILR IRQLLQSQAT 
250 260 270 280 290 300 
EAERSSQNKH ETGSHDAERQ NEGQGVGEIN MATSGNGQGS 
TTRMDHETAS VLSSSSTHSA 
310 320 330 340 350 360 
PRRLTSHLGT KVEMVYSLLS MLGTHDKDDM SRTLLAMSSS 
QDSCISMRQS GCLPLLIQLL 
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370 380 390 400 410 420 
HGNDKDSVLL GNSRGSKEAR ARASAALHNI fflSQPDDKRG 
RREIRVLHLL EQIRAYCETC 
430 440 450 460 470 480 
5 WEWQEAHEPG MDQDKNPMPA PVEHQICPAV CVLMKLSFDE 

EHRHAMNELG GLQAIAELLQ 
490 500 510 520 530 540 
VDCEMYGLTN DHYSITLRRY AGMALTNLTF GDVANKATLC 
SMKGCMRALV AQLKSESEDL 
10 550 560 570 580 590 600 

QQVTASVLRN LSWRADVNSK KTLREVGSVK ALMECALEVK 
KESTLKSVLS ALWNLSAHCT 
610 620 630 640 650 660 
ENKADICAVD GALAFLVGTL TYRSQTNTLA IffiSGGGILR 
15 NVSSLIATNE DHRQILRENN 

670 680 690 700 710 720 
CLQTLLQHLK SHSLTTVSNA CGTLWNLSAR NPKDQEALWD 
MGAVSMLKNL IHSKHKMIAM 
730 740 750 760 770 780 
20 GSAAALRNLM ANRPAKYKDA NIMSPGSSLP SLHVRKQKAL 

EAELDAQHLS ETFDNIDNLS 
790 800 810 820 830 840 
PKASHRSKQR HKQSLYGDYV FDTNRHDDNR SDNFNTGNMT 
VLSPYLNTTV LPSSSSSRGS 
25 850 860 870 880 890 900 

LDSSRSEKDR SLERERGIGL GNYHPATENP GTSSKRGLQI 
STTAAQIAKV MEEVS AIHTS 
910 920 930 940 950 960 
QEDRSSGSTT ELHCVTDERN ALRRSSAAHT HSNTYNFTKS 
30 ENSNRTCSMP YAKLEYKRSS 

970 980 990 1000 1010 1020 
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NDSLNSVSSS DGYGKRGQMK PSIESYSEDD ESKFCSYGQY 
PADLAHKIHS ANHMDDNDGE 
1030 1040 1050 1060 1070 1080 
LDTPINYSLK YSDEQLNSGR QSPSQNERWA RPKHDEDEI 
5 KQSEQRQSRN QSTTYPVYTE 

1090 1100 1110 1120 1130 1140 
STDDKHLKFQ PHFGQQECVS PYRSRGANGS ETNRVGSNHG 
INQNVSQSLC QEDDYEDDKP 
1150 1160 1170 1180 1190 1200 
10 TNYSERYSEE EQHEEEERPT NYSDCYNEEK RHVDQPIDYS 

LKYATDIPSS QKQSFSFSKS 
1210 1220 1230 1240 1250 1260 
SSGQSSKTEH MSSSSENTST PSSNAKRQNQ LHPSSAQSRS 
GQPQKAATCK VSSINQETIQ 
15 1270 1280 1290 1300 1310 1320 

TYCVEDTPIC FSRCSSLSSL SSAEDEIGCN QTTQEADSAN 
TLQIAEIKEK IGTRSAEDPV 
1330 1340 1350 1360 1370 1380 
SEVPAVSQHP RTKSSRLQGS SLSSESARHK AVEFSSGAKS 
20 PSKSGAQTPK SPPEHYVQET 

1390 1400 1410 1420 1430 1440 
PLMFSRCTSV SSLDSFESRS IASSVQSEPC SGMVSGESP 
SDLPDSPGQT MPPSRSKTPP 
1450 1460 1470 1480 1490 1500 
25 PPPQTAQTKR EVPKNKAPTA EKRESGPKQA AVNAAVQRVQ 

VLPDADTLLH FATESTPDGF 
1510 1520 1530 1540 1550 1560 
SCSSSLSALS LDEPFIQKDV ELRIMPPVQE NDNGNETESE 
QPKESNENQE KEAEKTIDSE 
30 1570 1580 1590 1600 1610 1620 
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KDLLDDSDDD DIEILEECII SAMPTKSSRK AKKPAQTASK 
LPPPVARKPS QLPVYKLLPS 
1630 1640 1650 1660 1670 1680 
QNRLQPQKHV SFTPGDDMPR VYCVEGTPIN FSTATSLSDL 
HESPPNELA AGEGVRGGAQ 
1690 1700 1710 1720 1730 1740 
SGEFEKRDTI PTEGRSTDEA QGGKTSSVTI PELDDNKAEE 
GDILAECINS AMPKGKSHKP 
1750 1760 1770 1780 1790 1800 
FRVKKIMDQV QQASASSSAP NKNQLDGKKK KPTSPVKPIP 
QNTEYRTRVR KNADSKNNLN 
1810 1820 1830 1840 1850 1860 
AERVFSDNKD SKKQNLKNNS KDFNDKLPNN EDRVRGSFAF 
DSPHHYTPIE GTPYCFSKND 
1870 1880 1890 1900 1910 1920 
SLSSLDFDDD DVDLSREKAE LRKAKENKES EAKVTSHTEL 
TSNQQSANKT QAIAKQPINR 
1930 1940 1950 1960 1970 1980 
GQPKPILQKQ STFPQSSKDI PDRGAATDEK LQNFADENTP 
VCFSHNSSLS SLSDIDQENN 
1990 2000 2010 2020 2030 2040 
NKENEPKET EPPDSQGEPS KPQASGYAPK SFHVEDTPVC 
FSRNSSLSSL SIDSEDDLLQ 
2050 2060 2070 2080 2090 2100 
ECISSAMPKK KKPSRLKGDN EKHSPRNMGG ILGEDLTLDL 
KDIQRPDSEH GLSPDSENFD 
2110 2120 2130 2140 2150 2160 
WKAIQEGANS IVSSLHQAAA AACLSRQASS DSDSILSLKS 
GISLGSPFHL TPDQEEKPFT 
2170 2180 2190 2200 2210 2220 
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SNKGPRILKP GEKSTLETKK IESESKGIKG GKKVYKSLIT 
GKVRSNSEIS GQMKQPLQAN 
2230 2240 2250 2260 2270 2280 
MPSISRGRTMIBflPGVRNSS SSTSPVSKKG PPLKTPASKS 
5 PSEGQTATTS PRGAKPSVKS 

2290 2300 2310 2320 2330 2340 
ELSPVARQTS QIGGSSKAPS RSGSRDSTPS RPAQQPLSRP 
IQSPGRNSIS PGKNGISPPN 
2350 2360 2370 2380 2390 2400 
10 KLSQLPRTSS PSTASTKSSG SGKMSYTSPG RQMSQQNLTK 

QTGLSKNASS IPRSESASKG 
2410 2420 2430 2440 2450 2460 
LNQMNNGNGA NKKVELSRMS STKSSGSESD RSERPVLVRQ 
STFKEAPSP TLRRKLEESA 
15 2470 2480 2490 2500 2510 2520 

SFESLSPSSR PASPTRSQAQ TPVLSPSLPD MSLSTHSSVQ 
AGGWRKLPPN LSPHEYNDG 
2530 2540 2550 2560 2570 2580 
RPAKRHDIAR SHSESPSRLP INRSGTWKRE HSKHSSSLPR 
20 VSTWRRTGSS SSILS ASSES 

2590 2600 2610 2620 2630 2640 
SEKAKSEDEK HVNSISGTKQ SKENQVSAKG TWRKKENEF 
SPTNSTSQTV SSGATNGAES 
2650 2660 2670 2680 2690 2700 
25 KTLIYQMAPA VSKTEDVWVR ffiDCPINNPR SGRSPTGNTP 

PVIDSVSEKA NPNIKDSKDN 
2710 2720 2730 2740 2750 2760 
QAKQNVGNGS VPMRTVGLEN RLNSFIQVDA PDQKGTEIKP 
GQNNPVPVSE TNESSIVERT 
30 2770 2780 2790 2800 2810 2820 
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PFSSSSSSKH SSPSGTVAAR VTPFNYNPSP RKSSADSTSA 
RPSQIPTPVN NNTKKRDSKT 
2830 2840 28S0 2860 2870 2880 
DSTESSGTQS PKRHSGSYLV TSV*KRGRMK LRKFYVNYNC 
5 YIDILFQMKL *KTEKFCK*V 

2890 2900 2910 2920 2930 2940 
♦FLLEGFCSG SHI**YTLSS LVLFWEALLM VRKKIVKPSM 
FVQYVLHVFK VAPIPTSFNY 
2950 2960 2970 2980 2990 3000 
10 CLS*NNEHYR *KI*YIAVIN HF*HN*LNL HQGKIGIYAK KNVF 
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SEQ ID NO: 8 Tcf-IB amino acid 
SEQ ID NO: 9 Tcf-IE amino acid 
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fTATMS 

1. An intron-free DNA molecule encoding Tcf-4 protein as 
shown in SEQ ID NO: 2 or 4. 

2. The DNA molecule of claim 1 which has the nucleotide 
sequence of SEQ ID NO: lor 3. 

3. An isolated Tcf-4 protein, substantially free of other human 
proteins, having a sequence as shown in SEQ ID NO: 2 or 4. 

4. A method of determining the presence or absence in a cell of 
wild-type APC or a downstream protein in the APC transcription 
regulatory pathway, comprising the steps of: 

introducing a Tcf-responsive reporter gene into the cell; and 
measuring transcription of said reporter gene; wherein a cell 
which supports active transcription of said reporter gene does not 
have wild-type APC or a downstream protein in the APC 
transcription regulatory pathway. 

5. A method of determining the presence or absence in a cell of 
wild-type APC, comprising the steps of: 

contacting a Tcf-responsive reporter gene with a lysate of 
the ceil; and 

measuring transcription of said reporter gene; wherein a 
lysate which inhibits said transcription has wild-type APC. 

6. A method of identifying candidate drugs for use in FAP 
patients, patients with APC or P-catenin mutations, or patients 
with increased risk of developing cancer, comprising the steps of: 

contacting a cell having no wild-type APC or a mutant P- 
catenin with a test compound; 
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measuring transcription of a Tcf-responsive reporter gene, 
wherein a test compound which inhibits the transcription of the 
reporter gene is a candidate drug for cancer therapy. 

7. The method of claim 6 wherein the cell produces an APC 
protein defective in P-catenin binding or regulation. 

8. The method of claim 6 wherein the cell produces a P-catenin 
protein which is super-active, or which is defective in APC 
binding or resistant to APC regulation. 

9. The method of claim 6 wherein the cell produces no 
detectable APC protein. 

10. A method of identifying candidate drugs for use in for use in 
FAP patients, patients with APC or P-catenin mutations, or 
patients with increased risk of developing cancer, comprising the 
steps of: 

contacting a Tcf-responsive reporter gene with a test 
compound under conditions in which the reporter gene is 
transcribed in the absence of the test compound; and 

measuring transcription of the Tcf-responsive reporter gene; 
wherein a test compound which inhibits said transcription is a 
candidate drug for cancer therapy. 

11. The method of claim 10 wherein the step of contacting is 
performed in the presence of a lysate of a cell which has no wild- 
type APC. 

12. The method of claim 10 wherein the step of contacting is 
performed in the presence of a lysate of a cell which has a mutant 
P-catenin defective in APC binding or resistant to APC regulation 
or which is super-active. 

13. The method of claim 1 1 wherein the cell produces an APC 
protein defective in P-catenin binding or regulation. 
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14. A method for diagnosing cancer in a sample suspected of 
being neoplastic, comprising the steps of: 

comparing a CTNNB sequence found in the sample to a 
second CTNNB sequence found in a normal tissue, wherein a 
5 difference between the first and second sequence is an indicator of 

cancer. 

15. The method of claim 14 wherein the sequences are 
nucleotide sequences. 

16. The method of claim 14 wherein the sequences are amino 
10 acid sequences. 

17. The method of claim 14 wherein the normal tissue is isolated 
from the same human as the sample. 

18. The method of claim 14 wherein the cancer is selected from 
the group consisting of: colorectal cancer, thyroid cancer, brain 

15 cancer, medulloblastoma, desmoid tumor, osteoma, breast cancer, 

and head and neck cancer. 

19. The method of claim 14 wherein the step of comparing is 
preceded by the step of: 

comparing a first APC sequence found in the sample to a 
20 second APC sequence found in a normal tissue. 

20. The method of claim 19 where no differences are detected 
between the first and second APC sequences. 

21. The method of claim 14 wherein no APC mutations have 
been detected in the sample. 

25 22. A method for treating a patient with colorectal cancer or 

other cancer associated with FAP, the method comprising the step 
of: 

administering to the patient a nucleotide sequence 
comprising a portion of the APC coding sequence, said portion 
30 consisting of the P-catenin binding site. 
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23. A method for treating a patient with colorectal cancer or 
other cancer associated with FAP, the method comprising the step 
of: 

administering to the patient a polypeptide comprising a 
5 portion of the APC coding sequence, said portion consisting of the 

P-catenin binding site. 

24. A method of identifying candidate drugs for use in FAP 
patients or patients with increased risk of developing cancer, 
comprising the steps of: 

10 contacting a test compound with p-catenin and Tcf-4 under 

conditions in which p-catenin and Tcf-4 bind to each other; and 

determining whether the test compound inhibits the binding 
of P-catenin and Tcf-4, a test compound which inhibits the binding 
being a candidate for cancer therapy or prophylaxis. 
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FIGURE 2a 
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FIGURE 5 A 



WO 98/41631 



8/15 



PCT/US98/05506 



Relative CRT (fold) 

O O O O O — ^ — ^ 

o ho 4^ 'g> bo o ho 




FIGURE 5 b 
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FIGURE 6a 
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FIGURE 6 b 
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FIGURE 7 b 
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